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1^ (54) Title: COMPOSITIONS AND METHODS FOR GENERATING EXPRESSION VECTORS THROUGH SITE-SPECIFIC 

I-*. RECOMBINATION 

00 

^ (57) Abstract: Compositions, kits, and methods are provided for use in a recombi national cloning or subcloning methods for con- 

\ structing expression vectors which comprise: ligating a library of double-stranded linear donor DNAs, where each member of the 
library includes a donor DNA sequence, with a double-stranded linear driver DNA which includes a promoter sequence and a donor 
recombination site to form a single circular donor DNA, the single circular donor DNA not including an origin of replication, where 

Q the donor DNA sequence is under the transcriptional control of the promoter, and contacting the circular donor DNA and a circular 
acceptor vector in the presence of a recombi nase to form a single fused circular vector, the circular acceptor vector comprising an 

^ origin of replication and an acceptor recombination site capable of recombining with the circular donor DNA. 
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COMPOSITIONS AND METHODS FOR GENERATING EXPRESSION 
VECTORS THROUGH SITE-SPECIFIC RECOMBINATION 

5 

Field of the Invention 

This invention relates to recombinant DNA technology, nucleic 
acids, vectors and methods for use in a recombinational cloning or 
subcloning, and more specifically for constructing expression vectors by 
1 0 using recombination proteins in vitro or in vivo through site-specific 
recombination. 

Description of Related Art 

Recombinant DNA technology, also called gene cloning or 
1 5 molecular cloning, is widely used to transfer genetic information, i.e. 
DNA, from one organism to another. A typical recombinant DNA 
experiment often follows the following procedure. First, the DNA (e.g., 
the cloned DNA, insert DNA, target DNA, or foreign DNA) from a donor 
organism is extracted, enzymatically cleaved (or cut/digested), and 
20 joined (ligated) to another DNA entity (e.g. a cloning vector) to form a 
new, recombinant DNA molecule (or cloning vector-insert DNA 
construct). Second, this cloning vector-insert DNA construct is 
transferred into and maintained within a host cell, such as 
transformation of a bacterial host cell by the construct. Third, those host 
25 cells that take up the DNA construct (transformed cells) are identified 
and selected from those that do not. In addition, if required, a DNA 
construct can be prepared to ensure that the protein product that is 
encoded by the cloned DNA sequence is produced by the host cell. 

30 Accordingly, this traditional cloning methods using restriction 

enzymes and ligase can be time consuming, especially when a specific 
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expression vector is required for transferring the target gene into a 
heterologous host cell, such as a mammalian cell. The specific 
expression vector may not contain matching restriction sites for the 
donor DNA. Extensive reengineering of the expression vector may be 
5 required to introduce the matching restriction sites into the vector so that 
the vector and the insert DNA can be ligated to produce the final 
construct. Alternatively, multiple restriction enzymes may have to be 
employed to generate an insert DNA having suitable restriction sites for 
ligation with the vector. In this case, reaction conditions for each 

1 0 restriction enzyme may differ such that it is often necessary to perform a 
few separate restriction digestion reactions to obtain the desired insert. 
Further, the efficiency of direct ligation between the vector and insert 
may be very low, especially between large fragments. As a result, the 
whole procedure is tedious, and the final yield of the correctly ligated 

15 construct can be low. 



Site-specific recombination represents another useful method of 
recombinant DNA technology. This method employs a site-specific 
recombinase, an enzyme which catalyzes the exchange of DNA 

20 segments at specific recombination sites. Site-specific recombinases 
present in some viruses and bacteria, and have been characterized to 
have both endonuclease and ligase properties. These recombinases, 
along with associated proteins in some cases, recognize specific 
sequences of bases in DNA and exchange the DNA segments flanking 

25 those segments. Landy, A. (1993) Current Opinion in Biotechnology 
3:699-707. 



A typical site-specific recombinase is Cre recombinase. Cre is a 
38-kDa product of the cre (cyclization recombination) gene of 
30 bacteriophage P1 and is a site-specific DNA recombinase of the Int 
family. Sternberg, N. etal. (1986) J. Mol. Biol. 187: 197-212. Cre 
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recognizes a 34-bp site on the P1 genome called loxP (locus of X-over 
of P1) and efficiently catalyzes reciprocal conservative DNA 
recombination between pairs of loxP sites. The loxP site consists of two 
13-bp inverted repeats flanking an 8-bp nonpalindromic core region. 
5 Cre-mediated recombination between two directly repeated loxP sites 
results in excision of DNA between them as a covalently closed circle. 
Cre-mediated recombination between pairs of loxP sites in inverted 
orientation will result in inversion of the intervening DNA rather than 
excision. Breaking and joining of DNA is confined to discrete positions 
10 within the core region and proceeds on strand at a time by way of 

transient phophotyrosine DNA-protein linkage with the enzyme. Other 
examples of site-specific recombination systems include the 
integrase/att system form bacteriophage X and the FLP/FRT system 
from the Saccharvmyces cerevisiae 2pi circle plasmid. 

15 

These site-specific recombination systems have been used in 
vivo to facilitate recombination between different vectors. Waterhouse 
et al. used an in vivo method to join light and heavy chains of an 
antibody. The light and heavy chains were cloned in different phage 

20 vectors between loxP and loxP 511 sites that were used to transform 
new E. coli cells. Waterhouse, P. et al. (1993) Nucleic Acid Res. 
21:2265-2266. Cre acted on two parental molecules, one plasmid and 
another phage, in the host cells to produce four products in equilibrium: 
two different cointegrates (produced by recombination at either loxP or 

25 loxP51 1 sites), and two daughter molecules, one of which was the 
desired product. Schlake and Bode used an in vivo method to 
exchange expression cassettes at defined chromosomal locations, each 
flanked by a wild type and spacer-mutated FRT recombination site. 
Schlake and Bode (1994) Biochemistry 33:12746-12751. A double- 

30 reciprocal crossover was mediated in cultured mammalian cells by using 
the FLP/FRT system for site-specific recombination. Aoki et al. used a 
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shuttle plasmid (pAdMCS) that carried a gene of interest, a loxP site, the 
adenoviral 5-LTR and packaging signal 0 to 1 mu, and a multiple cloning 
site. Aoki et al. (1999) Mol. Med. 5:224-231. The shuttle plasmid was 
linearized by a restriction enzyme Nhel and recombined with Clal- 
5 digested adenoviral cosmid in vitro. Cre recombinase produced the full- 
length recombinant adenoviral vector in vitro by an exchange of region 
distal to the loxP site linearized in these two molecules. 

10 SUMMARY OF THE INVENTION 

The present invention relates to compositions, kits, and methods 
for use in a recombinational cloning or subcloning. In particular, the 
present invention provides novel methods for constructing expression 
15 vectors by using site-specific recombinases in vitro. These method may 
be used for high throughput screening of genes, functional genomics 
and other human genome projects. 

In one aspect, the present invention provides a double-stranded 
20 circular donor DNA for transferring a donor DNA sequence into 

expression vectors. The circular donor DNA comprises: a donor DNA 
sequence; a donor recombination site; at least one selectable marker, 
the circular donor DNA not including an origin of replication. 

25 The donor DNA sequence may be any gene of interest or any 

synthetic DNA sequence which is needed to be transferred into an 
expression vector. For example the donor DNA segment may be a 
sequence derived from cDNA of a particular gene or one of the 
members of a cDNA library. The donor DNA may also be a genomic 

30 DNA that contains the coding region interrupted with non-coding 
sequences. 
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In another aspect, the present invention also provides a library of 
double-stranded circular donor DNAs that may be used for high 
throughput screening. The library of double-stranded circular DNA 
5 comprises: a donor DNA sequence which varies within a library of donor 
DNA sequences; a donor recombination site; and at least one selectable 
marker, the circular donor DNA not including an origin of replication. 

The library of donor DNA sequences may be a library of cDNA or 
10 genomic DNA derived from any desirable sources. For example, the 
library of donor DNA sequences may be a cDNA library from single 
human chromosomes. 



The circular donor DNA may further comprise a promoter 
1 5 sequence that controls expression of the donor DNA sequence. The 
promoter may be any array of DNA sequences that interact specifically 
with cellular transcription factors to regulate transcription of the 
downstream gene. The promoter may be derived from any organism, 
such as bacteria, yeast, insect and mammalian cells and viruses. 
20 Examples of the promoter include, but are not limited to, E. coli lac and 
tip operons, the tac promoter, the bacteriophage X p 1 promoter, 
bacteriophage T7 and SP6 promoters, p-actin promoter, insulin 
promoter, human cytomegalovirus (CMV) promoter, HIV-LTR (HIV-long 
terminal repeat), Rous sarcoma virus RSV-LTR, simian virus SV40 
25 promoter, baculoviral polyhedrin and p10 promoter. 



The promoter may also be an inducible promoter that regulates 
the expression of downstream gene in a controlled manner. Examples 
of inducible promoters include, but are not limited to, the bacterial dual 
30 promoter (activator/repressor expression system) which regulates gene 
expression in mammalian cells under the control of tetracycline and its 
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analogs and promoters that regulate gene expression under the control 
of factors such as heat shocks, steroid hormones, heavy metals, phorbol 
ester, the adenovirus E1A element, interferon, or serum. 

5 The donor recombination site may be any segment or arrays of 

DNA sequence recognized by a site-specific recombinase which 
catalyzes site-specific fusion between the circular donor DNA and an 
acceptor vector. The site-specific recombinase may be a recombinase, 
a transposase or an integrases. 

0 

In one variation, the recombination site is a lox site that is 
recognized by the Cre recombinase of bacteriophage PI. Example of lox 
site includes, but are not limited to, loxB, loxL, loxR, loxP [SEQ ID 
NO:1], loxP3, loxP23, loxA86, loxA117, loxP511 [SEQ ID NO:2], and 
5 loxC2 [SEQ ID NO:3]. 

In another variation, the recombination site is a recombination 
site that is recognized by a recombinases other than Cre. Examples of 
the non-Cre recombinases include, but are not limited to, site-specific 

0 recombinases include: att sites recognized by the Int recombinase of 
bacteriophage X (e.g. att1, att2, att3, attP, attB, attL, and attR), the FRT 
sites recognized by FLP recombinase of the 2pi plasmid of 
Saccharomyces cerevisiae, the recombination sites recognized by the 
resolvase family, and the recombination site recognized by transposase 

:5 of Bacillus thruingiensis. 

The example of site-specific recombinase include, but are not 
limited to, bacteriophage P1 Cre recombinase, yeast FLP recombinase, 
Inti integrase, bacteriophage X, phi 80, P22, P2, 186, and P4 
\0 recombinase, Tn3 resolvase, the Hin recombinase, and the Cin 

recombinase, E. co//xerC and xerD recombinases, Bacillus thuringiensis 
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recombinase, Tpnl and the p-lactamase transposons, and the 
immunoglobulin recombinases, 

The selectable marker of the circular donor DNA may be any 
5 functional element for facilitating subsequent identification and selection 
of clones of the recombination product under suitable conditions. The 
selectable marker may encode any ftinctional element, such as protein, 
peptide, RNA, binding site for RNA and proteins, or products that 
provide resistance to organic or inorganic agents. Examples of 

10 selectable markers include, but are not limited to, reporter genes such 
as O-galactosidase (GAL), fluorescent proteins (e.g., GFP, GFP-UV, 
EFFP, BFP, EBFP, ECFP, EYFP), secreted form of human placental 
alkaline phosphatase (SEAP), p-glucuronidase (GUS)); resistance 
genes against antibiotics (e.g. neomycin (G418) or hygromycin resistant 

15 gene, puromycin resistant gene), yeast seletable markers Ieu2-d and 
URA3, apoptosis resistant genes (e.g. baculoviral p35 gene), and 
antisenoligonucleotides. 

The circular donor DNA may optionally include an affinity tag for 
20 selection and isolation of protein product encoded by the donor DNA 
segment. Examples of such an affinity tag include, but are not limited 
to; a polyhistidine tract, polyarginine, glutathione-S- transferase (GST), 
maltose binding protein (MBP), a portion of staphylococcal protein A 
(SPA), and various immunoaffinity tags (e.g. protein A) and epitope tags 
25 such as those recognized by the EE (Glu-Glu) antipeptide antibodies. 
The affinity tag may be positioned at either the amino- or carboxy- 
terminus of the donor DNA. 

The present invention also provides a circular acceptor vector for 
30 generating recombinant expression vector. The vector comprises an 
origin of replication; and an acceptor recombination site capable of 
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recombining with a donor DNA. Optionally, the acceptor vector may not 
include a promoter for regulating expression of the donor DNA. 

The circular acceptor vector may be any vector that can 
5 transform, transfect or transduce a host cell. The acceptor vector may 
be a plasmid, a phage or a viral vector as long as it is able to replicate in 
vitro or in a host cell, or to convey the donor DNA to a desired location 
within a host cell. Examples of host cells include, but are not limited to, 
bacterial (e.g. E. coli, Bacillus subtilis, etc.), yeast, animal, plant, and 
10 insect cells. 

In one variation, the circular acceptor vector may be a prokaryotic 
plasmid. Optionally, the acceptor vector may comprise a prokaryotic 
termination sequence. Examples of the prokaryotic termination 
15 sequence include, but are not limited to, the T7 termination sequence, 
the Tint, T u , T^, T l3 , TR 1( TR 2 , termination signals derived from the 
bacteriophage X. 

In another variation, the circular acceptor vector may be a 
20 mammalian expression vector. The mammalian expression vector 

contains one or more eukaryotic marker genes, appropriate eukaryotic 
transcriptional and translational termination signals and a sequence that 
signals polyadenylation of the transcript messenger RNA (mRNA), and 
an origin of replication that functions in a mammalian host cell. 
25 Examples of the eukaryotic polyadenylation sequence include, but are 
not limited to, the Herpes simplex virus thymidine kinase 
polyadenylation sequence, the bovine growth hormone polyadenylation 
sequence, and the simian virus 40 polyadenylation sequence. 

30 Optionally, the eukaryotic expression vector may also carry an 

origin of replication and selectable marker genes that function in 
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bacterial cells, forming a shuttle vector. 

In yet another variation the circular acceptor includes a promoter 
for regulating expression of the donor DNA sequence carried by a 
5 circular donor DNA of the invention. According to this variation, the 
recombination site may be placed downstream of the promoter and the 
transcription initiation site in the acceptor vector. 

In yet another variation, the circular acceptor may be a yeast 
10 expression vector such as a S. cerevisiae expression vector. Various 
types of S. cerevisiae expression vector include, but are not limited to, 
episomal or plasmid vector, integrating vectors, and yeast chromosomes 
(YACs). 

15 In yet another variation, the circular acceptor vector may be a 

baculovirus DNA, such as wild type or mutant genomes of Autographa 
califomica multiple nuclear polyhedrosis virus (AcMNPV) virus. 

Optionally, a baculovirai acceptor vector according to the present 
20 invention may not contain a polyhedrin promoter. Instead, the 

polyhedrin or the baculovirai p10 promoter can be positioned upstream 
of the donor DNA sequence of the circular donor DNA of the present 
invention. 

25 The present invention also provides kits for generating 

recombinant vectors. In one embodiment, the kit comprises: a double- 
stranded circular donor DNA comprising a donor DNA sequence, a 
donor recombination site, and at least one selectable marker, the 
circular donor DNA not including an origin of replication; and a circular 

30 acceptor vector comprising an origin of replication and an acceptor 

recombination site capable of recombining with the circular donor DNA. 
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In another embodiment, the kit comprises: a library of double- 
stranded circular donor DNA comprising a donor DNA sequence which 
varies within a library of donor DNA sequences, a donor recombination 
5 site, and at least one selectable marker, the circular donor DNA not 
including an origin of replication; and a circular acceptor vector 
comprising an origin of replication and an acceptor recombination site 
capable of recombining with the circular donor DNA. 

10 In yet another embodiment, the kit comprises: one or more linear 

donor DNA comprising a donor DNA sequence; a linear driver DNA 
comprising a promoter sequence, a recombination site, and at least one 
selectable marker, ligation of the linear donor DNA and the linear driver 
DNA resulting in a circular donor DNA; and a circular acceptor vector 

15 comprising an origin of replication and an acceptor recombination site 
capable of recombining with the circular donor DNA. 

The present invention also provides a method for generating 
recombinant expression vector in vitro through site-specific 

20 recombination between a circular donor DNA and circular acceptor 
DNA, each containing recombination site recognized by the 
recombinase. The method comprises: contacting a circular double- 
stranded donor DNA and a circular acceptor vector in the presence of a 
recombinase under conditions suitable for the circular double-stranded 

25 donor DNA and circular acceptor vector to recombine to form a single 

fused circular vector. In this method, the circular double-stranded donor 
DNA comprises a donor DNA sequence, a donor recombination site, 
and at least one selectable marker, but not including an origin of 
replication. The circular acceptor vector comprises an origin of 

30 replication and an acceptor recombination site capable of recombining 
with the circular donor DNA. The promoter for regulating expression of 
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the donor DNA may be contained in either the donor DNA or acceptor 
vector. 

According to this method, the circular donor DNA containing a 
5 site-specific recombination site may be recombined with a circular 

acceptor vector in the presence of Cre recombinase. The recombination 
sites on the circular donor DNA and the circular acceptor vector may 
each contain a lox site. 

1 0 The method may further include steps of transforming, 

transfecting or transducing a host cell and selecting the correctly fused 
recombinant vector based on the selectable phenotype conferred by the 
selectable marker gene on the recombinant vector. 

15 The present invention also provides a method for generating 

recombinant expression vectors from linear DNA segments in vitro. The 
method comprises: ligating one or more double-stranded linear donor 
DNA which includes a donor DNA sequence with a double-stranded 
linear driver DNA which includes a promoter sequence and a donor 

20 recombination site to form a single circular donor DNA, the singular 
circular donor DNA not including an origin of replication, where the 
donor DNA sequence is under the transcriptional control of the 
promoter; and contacting the circular donor DNA and a circular acceptor 
acceptor vector in the presence of a recombinase to form a single fused 

25 circular vector. In this method, the circular acceptor vector comprises 
an origin of replication and an acceptor recombination site capable of 
recombining with the circular donor DNA. 

According to this method, the linear donor DNA and linear driver 
30 DNA may contain matching restriction sites or other type of annealing 
sites so as to be ligated to form a circulaized DNA. The linear donor 
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and driver DNAs may be derived from PCR amplification products. 

The present invention also provides a method for high throughput 
production of recombinant expression vectors from linear DNA 
5 segments in vitro. The method comprises: ligating a library of double- 
stranded linear donor DNAs, where each member of the library includes 
a donor DNA sequence, with a double-stranded linear driver DNA which 
includes a promoter sequence and a donor recombination site to form a 
single circular donor DNA, the singular circular donor DNA not including 

10 an origin of replication, where the donor DNA sequence is under the 

transcriptional control of the promoter; and contacting the circular donor 
DNA and a circular acceptor acceptor vector in the presence of a 
recombinase to form a single fused circular vector. In this method, the 
circular acceptor vector comprises an origin of replication and an 

1 5 acceptor recombination site capable of recombining with the circular 
donor DNA. 

According to this method, the library of double-stranded linear 
donor DNAs may be DNAs amplified from a library of cDNA clones. The 
20 library of cDNA clones may be arrayed in a multi-well plate such as 96- 
and 384-well plates. The library of cDNA clones may be a cosmid or 
phage library. 

Also according to the method, ligating the library of double- 
25 stranded linear donor DNAs with a double-stranded linear driver DNA 
may be performed by Ligation Independent Cloning (LIC). Alternatively, 
ligating the library of double-stranded linear donor DNAs with a double- 
stranded linear driver DNA may be performed in the presence of T4 
DNA ligase. 

30 

The method may further include a step of transferring the 
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recombinant expression vector into a host and isolating the protein 
expressed from the vector by affinity tagging. The affinity tagging may 
be based on a polyhistidine tag (e.g. His 6 ), a protein tag (e.g., GST, 
maltose binding protein) or an epitope tag (e.g. an EE ag). 

5 

The methods of the present invention allow rapid and efficient 
generation of expression vectors containing the gene of interest without 
bacterial cloning. Direct ligation of linear donor DNA and linear driver 
DNA to generate a circular donor DNA allows for efficient cloning of 
10 donor DNA such as a cDNA library into an expression vector in an 

automated and high throughput manner. The methods can be used in a 
wide variety of high throughput arrays for functional genomics, protein 
genomics (proteomics), and other human genome projects. 

15 BRIEF DESCRIPTION OF FIGURES 

Figure 1 illustrates a process of constructing a library of 
baculoviral expression vectors through Cre-mediated site-specific 
recombination. 

20 

Figure 2 illustrates a process of constructing a baculoviral 
expression vector for the GUS gene through Cre-mediated site-specific 
recombination. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides reagents, kits and methods for 
use in a recombinational cloning or subcloning process, and, in 
30 particular, for constructing expression vectors by using a site-specific 
recombinase in vitro or in vivo. In one aspect, a method is used to 
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directly fuse a linear segment of donor DNA (e.g., gene of interest) with 
another linear segment of DNA comprising various functional elements 
such as promoters, selectable markers and a recombination site, 
resulting in a single circular donor DNA. This circular donor DNA is then 
5 recombined into a circular acceptor vector which also contains a 

recombination site through site-specific recombination catalyzed by a 
recombinase. The recombination product can be used to transform, 
transfect or transduce various types of host cells, depending on the 
specific type of acceptor vector used. 

10 

The circular donor DNA need not carry an origin of replication for 
propagation in host cells such as bacterial cells. Instead, the circular 
donor DNA may be produced from directly ligating two or more linear 
segments of DNA which may be amplified by polymerase chain reaction 

1 5 (PCR). Such a separation and ligation of different segments of DNA 
allows flexible distribution of different elements among the linear 
segments. For example, one linear segment may contain the gene of 
interest amplified from a cDNA library, while the other linear segment 
contains functional elements essential for subsequent recombination in 

20 vitro (or in vivo) and expression in host cells. By using a site-specific 
recombinase, such as Cre recombinase, this circular DNA can be 
recombined into any gene-transferring vector without using restriction 
enzymes as long as the vector carries a recombination site recognized 
by the recombinase. Further, direct ligation of linear segments of DNA 

25 avoids laborious steps of bacterial cloning and facilitates high 

throughput screening of large library of genetic materials, such as cDNA 
libraries derived from diseased tissues or cells. In addition, the circular 
DNA produced by direct ligation of these segments can be free of other 
undesirable genetic materials such as "junk DNA" derived from a 

30 bacterial plasmid that may affect expression, viability or stability of the 
recombinant vector. 
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1 . Circular Donor DNA 

The present invention provides a double-stranded circular donor 
DNA for transferring a donor DNA sequence into expression vectors. 
The circular donor DNA comprises: a donor DNA sequence; a donor 
recombination site; at least one selectable marker, the circular donor 
DNA not including an origin of replication. The donor DNA sequence 
may be any gene of interest which is needed to be transferred into an 
expression vector. 

The present invention also provides a library of double-stranded 
circular donor DNAs which comprises: a donor DNA sequence which 
varies within a library of donor DNA sequences; a donor recombination 
site; and at least one selectable marker, the circular donor DNA not 
including an origin of replication. 

The circular donor DNA contains a donor DNA segment (either 
cDNA or genomic DNA), a promoter (e.g. SV40 early gene enhancer), a 
selectable marker (e.g. Neo gene), and a sequence-specific 
recombinase target site (e.g. a loxP site). The promoter controls 
expression of the gene of interest and the selectable marker gene when 
the circular donor DNA is recombined with an acceptor vector and the 
resulting recombinant vector is introduced into a host cell. The circular 
donor DNA may further contain a polyadenylation signal for expression 
in mammalian cells. 

The donor DNA sequence may be any deoxyribonucleotide 
sequence encoding a functional gene or any synthetically generated 
DNA sequence. For example the donor DNA segment may be a 
sequence derived from cDNA of a particular gene or one of the 
members of a cDNA library. The cDNA library may be produced by 
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converting mRNAs in a sample into double-stranded complementary 
DNA (cDNA) by using reverse transcriptase (RT) and the Klenow 
fragment of nucleic acid polymerase I. Depending on the source of 
mRNA sample, the cDNA library may contain various populations of 
5 genes of interest, such as disease genes located in certain tissue or 

type of cells. The donor DNA may also be a genomic DNA that contains 
the coding region interrupted with non-coding sequences 
(introns/intervening sequences). These introns may contain regulatory 
elements such as enhancers. 

10 

The circular donor DNA may further comprises a promoter 
sequence that controls expression of the donor DNA sequence. The 
promoter may be any array of DNA sequences that interact specifically 
with cellular transcription factors to regulate transcription of the 

15 downstream gene. The promoter may be derived from any organism, 
such as bacteria, yeast, insect and mammalian cells and viruses. The 
selection of a particular promoter depends on what cell type is to be 
used to express the protein of interest. Examples of the promoter 
include, but are not limited to, E. colilac and tip operons, the tac 

20 promoter, the bacteriophage X p l promoter, bacteriophage 77 and SP6 
promoters, B-actin promoter, insulin promoter, human cytomegalovirus 
(CMV) promoter, HIV-LTR (HIV-long terminal repeat), Rous sarcoma 
virus RSV-LTR, simian virus SV40 promoter, baculoviral polyhedrin and 
p10 promoter. The promoter may also be an inducible promoter that 

25 regulates the expression of downstream gene in a controlled manner, 
such as under a specific condition of the cell culture. Examples of 
inducible promoters include, but are not limited to, the bacterial dual 
promoter (activator/repressor expression system) which regulates gene 
expression in mammalian cells under the control of tetracyclines 

30 (Gossen, M. and Bujard, H. 1992, Proc.Natl. Acad. Sci. USA, 89, 5547- 
5551) and promoters that regulate gene expression under the control of 
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factors such as heat shocks, steroid hormones, heavy metals, phorbol 
ester, the adenovirus E1A element, interferon, or serum. 

The recombination site may be any segment or arrays of DNA 
sequence recognized by site-specific recombinase which catalyzes site- 
specific fusion between the circular donor DNA and the acceptor vector. 
The site-specific recombinase may be any enzymes that recognize 
short DNA sequences that become the crossover regions during the 
recombination event, including but not limited to recombinases, 
transposases and integrases. 

Site-specific recombinases may derived from prokaryotic and 
eukaryotic sources. Examples of site-specific recombination include 1) 
chromosomal rearrangements which occur in Salmonella typhimurium 
during phase variation, inversion of the FLP sequence during the 
replication of the yeast 2 ji circle and in the rearrangement of 
immunoglobulin and T cell receptor genes in vertebrates, 2) integration 
of bacteriophages into the chromosome of prokaryotic host cells to form 
a lysogen and 3) transposition of mobile genetic elements (e.g., 
transposons) in both prokaryotes and eukaryotes. 

In one embodiment, the recombination site is a loxP site that is 
recognized by the Cre recombinase of bacteriophage PI. The Cre 
recombinase catalyzes recombination of DNA between two loxP sites. 
The loxP site consists of a double-stranded 34 bp sequence: 

5 , -ATAACTTCGTATAATGTATGCTATACGAAGTTAT-3 , 
3 , -TATTGAAGCATATTACATACGATATGCTTCAATA-5 , 

(SEQ ID NO:1) 
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The loxP site comprises two 13 bp inverted repeat sequences separated 
by an 8 bp spacer region. Hoess et al. (1982) Proc. Natl. Acad. Sci. 
USA 79:3398. The internal spacer sequence of the loxP site is 
asymmetrical and thus, two loxP sites can exhibit directionality relative 
5 to one another. Hoess et al. (1984) Proc. Natl. Acad. Sci. USA 81 :1026. 
When two loxP sites on the same DNA molecule are in a directly 
repeated orientation, Cre excises the DNA between these two sites 
leaving a single loxP site on the DNA molecule. Abremski et al. (1983) 
Cell 32:1301. If two loxP sites are in opposite orientation on a single 
10 DNA molecule, Cre inverts the DNA sequence between these two sites 
rather than removing the sequence. 

The Cre recombinase also recognizes a number of variant or 
mutant lox sites relative to the loxP sequence. Examples of these Cre 
15 recombination sites include, but are not limited to, the loxB, loxL and 
loxR sites which are found in the E. coli chromosome. Hoess et al. 
(1982), supra. Other variant lox sites include: 

loxP511 site: 5 , -ATAACTTCGTATAGTATACATTATACGAAGTTAT-3 , 
20 (SEQ ID NO:2); 

Hoess etal. (1986) Nucleic Acid Res. 14:2287-2300, 

loxC2 site: 5'-ACAAC TTCGTATAATGTATGCTATACGAAGTTAT-3' 
(SEQ ID NO:3) 
25 U.S. Pat. No. 4,959,317. 

Cre catalyzes the cleavage of the lox site within the spacer region 
and creates a six base-pair staggered cut. Hoess and Abremski (1985) 
J. Mol. Biol. 181:351 . The two 13 bp inverted repeat domains of the lox 
30 site represent binding sites for the Cre protein. If two lox sites differ in 
their spacer regions in such a manner that the overhanging ends of the 
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cleaved DNA cannot reanneal with one another, Cre may not efficiently 
catalyze a recombination event using the two different lox sites. For 
example, it has been reported that Cre could not efficiently recombine a 
loxP site and a loxP51 1 site; these two lox sites differ in the spacer 
5 region. Two lox sites which differ due to variations in the binding sites 
(i.e., the 13 bp inverted repeats) may be recombined by Cre provided 
that Cre can bind to each of the variant binding sites; the efficiency of 
the reaction between two different lox sites (varying in the binding sites) 
may be less efficient that between two lox sites having the same 
1 0 sequence (the efficiency will depend on the degree and the location of 
the variations in the binding sites). For example, the loxC2 site can be 
efficiently recombined with the loxP site; these two lox sites differ by a 
single nucleotide in the left binding site. 

15 The Cre protein has been purified to homogeneity. Abremskiet 

al. (1984) J. Mol. Biol. 259:1509. And the cre gene has been cloned 
and expressed in a variety of host cells. Abremski et al. (1983), supra. 
Purified Cre protein is available from a number of suppliers (e.g., 
Novagen and New England Nuclear/Du Pont). 

20 

The recombination site of the circular DNA may also be selected 
from a variety of other recombination sites recognized by recombinases 
other than Cre. Examples of the non-Cre recombinases include, but are 
not limited to, site-specific recombinases include: the Int recombinase 
25 of bacteriophage □, the FLP recombinase of the 2pi plasmid of 
Saccharomyces cerevisiae, the resolvase family, transposase of 
Bacillus thruingiensis. 

The Int recombinase of bacteriophage X belongs to the integrase 
30 family and mediates the integration of the X genome into the E. coli 
chromosome. The Int recombinase of bacteriophage A. promotes 
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irreversible recombination between its substrate aft sites as part of the 
the formation or induction of a lysogenic state. Landy, A., (1989) Ann. 
Rev. Biochem. 58:913. Reversibility of the recombination reactions 
results from two independent pathways for integrative and excisive 
5 recombination. Each pathway uses a unique but overlapping set of the 
15 protein binding sites that comprise aft site DNAs. Cooperative and 
competitive interactions involving four proteins (Int, Xis, IHF and FIS) 
determine the direction of recombination. Integrative recombination 
involves the Int and IHF proteins and sites attP (240 bp) and attB (25 

10 bp). Recombination results in the formation of two new sites: attL and 
attR. Excisive recombination requires Int, IHF, and Xis, and sites attL 
and attR to generate attP and aftB. Under certain conditions, FIS 
stimulates excisive recombination. In addition to these normal reactions, 
it should be appreciated that attP and attB, when placed on the same 

15 molecule, can promote excisive recombination to generate two excision 
products, one with attL and one with attR. Similarly, intermolecular 
recombination between molecules containing attL and attR, in the 
presence of Int, IHF and Xis, can result in integrative recombination and 
the generation attP and attB. Derivatives of the att site with changes 

20 within the 1 5 bp core may also be suitable for efficient recombination. 
By incorporating a native or modified att site in both the circular donor 
DNA and the acceptor vector, intermolecular recombination between the 
donor and acceptor DNA molecules may be achieved by using the 
appropriate recombination protein such as Int, IHF and FIS, with or 

25 without Xis. Integrase can be obtained as described by Nash, H. A., 
(1983) Methods of Enzymology 100:210-216. IHF can be obtained as 
described by Filutowicz, M., et al., (1994) Gene 147:149-150. 

The other members of the integrase family of site-specific 
30 recombinases may also be used to provide alternative recombination 

proteins and recombination sites for the present invention. Examples of 
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such Int recombinases include, but not limited to, site-specific 
recombinase encoded by bacteriophage X, phi 80, P22, P2, 186, P4. 
This group of recombinases exhibits a large diversity of sequences, but 
all of the recombinases can be aligned in their C-terminal halves. A 40- 
5 residue region near the C terminus is particularly well conserved in all 
the proteins and is homologous to a region near the C terminus of the 
yeast 2\i plasmid Flp protein. Three positions are perfectly conserved 
within this family: histidine, arginine and tyrosine are found at respective 
alignment positions 396, 399 and 433 within the well-conserved C- 
1 0 terminal region. These residues contribute to the active site of this family 
of recombinases, and suggest that tyrosine-433 forms a transient 
covalent linkage to DNA during strand cleavage and rejoining. Argos, P. 
et al., (1986) EMBO J. 5:433-40. 

1 5 The FLP recombinase of the 2pi plasmid of Saccharomyces 

cer&visiae recognizes the frt site which, like the loxP site, comprises two 
13 bp inverted repeats separated by an 8 bp spacer 

5 , -GAAGTTCCTATTCTCTAGAAAGTATAGGAACTTC-3 , 
20 (SEQ ID NO:4) 

Cox (1983) Proc. Natl. Acad. Sci. USA 80:4223. 

The FLP gene has been cloned and expressed in E. coli and in 
mammalian cells and has been purified. Meyer-Lean et al. (1987) 
25 Nucleic Acids Res. 15:6469; Babineau et al (1985) J. Biol. Chem. 
260:12313; Gronostajski and Sadowski (1985) J. Biol. Chem. 
260:12328. 

The resolvase family members, such as the Tn3 resolvase, the 
30 Hin recombinase, and the Cin recombinase, may also be used for 

recombination between the circular donor DNA and the circular acceptor 
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DNA. Although members of this highly related family of recombinases 
are typically constrained to intramolecular reactions (e.g., inversions and 
excisions) and can require host-encoded factors. Mutants have been 
isolated that relieve some of the requirements for host factors as well as 
5 some of the constraints of intramolecular recombination. Maeser and 
Kahnmann (1991) Mol. Gen. Genet. 230:170-176. 



Transposase of Bacillus thuringiensis may also be used as 
recombination proteins and recombination sites. Bacillus thuringiensis is 

10 an entomopathogenic bacterium whose toxicity is due to the presence in 
the sporangia of A-endotoxin crystals active against agricultural pests 
and vectors of human and animal diseases. Most of the genes coding 
for these toxin proteins are plasmid-bome and are generally structurally 
associated with insertion sequences (IS231, IS232, IS240, ISBT1 and 

1 5 ISBT2) and transposons (Tn4430 and Tn5401). Several of these mobile 
elements have been shown to be active and participate in the crystal 
gene mobility, thereby contributing to the variation of bacterial toxicity. 
Structural analysis of the iso-IS231 elements indicates that they are 
related to IS1151 from Clostridium perfringens and distantly related to 

20 IS4 and IS1 86 from £ coli. Like the other IS4 family members, they 
contain a conserved transposase-integrase motif found in other IS 
families and retroviruses. Functional data gathered from IS231A in E. 
coli indicate a non-replicative mode of transposition, with a preference 
for specific targets. Similar results were also obtained in Bacillus subtilis 

25 and B. thuringiensis. Mahillon, J. et al., (1994) Genetica 93:13-26; 
(1992) Campbell, J. Bacterid. 7495-7499. 



Other recombination systems may also be used as recombination 
proteins and recombination sites, including the xerC and xerD 
30 recombinases of E. coli which together form a recombinase that 
recognizes the 28 bp dif site (Leslie and Sherratt (1995) EMBO J. 
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14:1561); the Int protein from the conjugative transposon Tn916 (Lu and 
Churchward (1994) EMBO J. 13:1541); Tpnl and the p-lactamase 
transposons (Levesque (1990) J. Bacterid. 172:3745); and the 
immunoglobulin recombinases (Malynn et al. (1988) Cell 54:453). 

Other than the wild-type recombination sites, modified 
recombination sites may also be used In the present invention. Wild-type 
recombination sites may contain sequences that reduce the efficiency or 
specificity of recombination reactions. For example, multiple stop 
codons in attB, attR, attP, attL and loxP recombination sites occur in 
multiple reading frames on both strands, thereby reducing 
recombination efficiencies. For example att sites, such as att1, att2, and 
att3 sites, may be modified to have one or multiple mutations to 
enhance specificity or efficiency of the recombination reaction and to 
decrease reverse reaction by removing P1and H1 from attB. 

The circular donor DNA also contains one or more selectable 
markers to facilitate subsequent identification and selection of clones of 
the recombination product under suitable conditions. The selectable 
marker may encode any functional element, such as protein, peptide, 
RNA, binding site for RNA and proteins, or products that provide 
resistance to organic or inorganic agents. Examples of selectable 
markers include, but are not limited to, reporter genes such as p- 
galactosidase (GAL), fluorescent proteins (e.g., GFP, GFP-UV, EFFP, 
BFP, EBFP, ECFP, EYFP), secreted form of human placental alkaline 
phosphatase (SEAP), p-glucuronidase (GUS)); resistance genes that 
encodes products which provide resistance against other wise toxic 
agents such as antibiotics (e.g. neomycin (G418) or hygromycin 
resistant gene, puromycin resistant gene), yeast seletable markers Ieu2- 
d and URA3, apoptosis resistant genes (e.g. the baculoviral p35 gene) 
that encode proteins that binds to products which are detrimental to cell 
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survival and promote apoptosis; antisenoligonucleotides, and any other 
DNA that encodes product that directly or directly confer sensitivity of 
cells to particular agents. 

5 The circular donor DNA may optionally include an affinity tag for 

selection and isolation of protein product encoded by the donor DNA 
segment. Examples of such an affinity tag include, but are not limited 
to, a polyhistidine tract, polyarginine, glutathione-S-transferase (GST), 
maltose binding protein (MBP), a portion of staphylococcal protein A 

1 0 (SPA), and various immunoaffinity tags (e.g. protein A) and epitope tags 
such as those recognized by the EE (Glu-Glu) antipeptide antibodies. 
Th affinity tag may also be a signal peptide either native or heterologous 
to baculovirus, such as honey bee mellitin signal peptide. The affinity 
tag may be positioned at either the amino- or carboxy-terminus of the 

15 donor DNA. 

2. Circular Acceptor Vector 

The present invention also provides a circular acceptor vector for 
20 generating recombinant expression vector. The vector comprises an 
origin of replication; and an acceptor recombination site capable of 
recombining with a donor DNA. Optionally, the acceptor vector may not 
include a promoter for regulating expression of the donor DNA. 

25 The circular acceptor vector may be any vector that can 

transform, transfect or transduce a host cell. The acceptor vector 
comprises a recombination site which is recognized by a site-specific 
recombinase and recombined with a donor DNA carrying another 
recombination site. The acceptor vector may be plasmids, phages or 

30 viral vectors as long as it is able to replicate in vitro, or in a host cell, or 
to convey the donor DNA to a desired location within a host cell. 
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Examples of host cells include, but are not limited to, bacterial (e.g. E. 
coli, Bacillus subtilis, etc.), yeast, animal, plant, and insect cells. 

For plasmid-based expression vector, the recombination site may 
5 be introduced into the vector by a double-stranded oligonucleotide 
containing the desired site-specific recombination site (e.g., a lox site). 
The double-stranded oligonucleotide may be formed by annealing two 
synthetic single-stranded oligonuceotides to form two ends which are 
compatible with ends of a linearized plasmid vector. The matching ends 
10 may be generated by restriction enzyme digestion or by using cloning 
kits such as the TA cloning kits available from Invitrogen, Inc. (San 
Diego, CA). 

The circular acceptor vector may be any prokaryotic plasmid that 
15 contains a recombination site (e.g. loxP site), a basic backbone of 

plasmid cloning vector such as pBR322, including one or more antibiotic 
resistant genes (e.g. Amp r , Tef) and an origin of replication that function 
in specific host cells. After recombination between the circular donor 
DNA and the acceptor vector to form a fused plasmid, this plasmid 
20 vector can be used to transform bacterial cells. In the transformed cell, 
a prokaryotic promoter, either carried by the donor DNA or the acceptor 
vector, causes expression of the donor DNA under suitable conditions. 

Optionally, the acceptor vector may comprise a prokaryotic 
25 termination sequence. Examples of the prokaryotic termination 

sequence include, but are not limited to, the T7 termination sequence. A 
variety of termination sequences are known to the art and may be 
employed in the nucleic acid constructs of the present invention 
including, the T 1NT , T u , T^, T L3 , TR 1( TR 2 , T 6S termination signals derived 
30 from the bacteriophage □. Hendrix et al. Eds., Cold Spring Harbor 

Press, Cold Spring Harbor, N.Y. (1983) and termination signals derived 
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from bacterial genes such as the trp gene of E. coli. 

The circular acceptor vector may also be a mammalian 
expression vector that contains a recombination site (e.g. loxP site), one 

5 or more eukaryotic marker gene, the appropriate eukaryotic 

transcriptional and translational termination signals and a sequence that 
signals polyadenylation of the transcript messenger RNA (mRNA), and 
an orgin of replication that functions in a mammalian host cell. 
Examples of the eukaryotic polyadenylation sequence include, but are 

1 0 not limited to, the Herpes simplex virus thymidine kinase 

polyadenylation sequence, the bovine growth hormone polyadenylation 
sequence, and the simian virus 40 polyadenylation sequence. 

If the circular acceptor carries a promoter for regulating 
1 5 expression of the donor DNA sequence, the recombination site may be 
placed downstream the promoter and transcription initiation site in 
acceptor vector. This modification of the vector may be easily 
accomplished using synthetic oligonucleotides comprising the desired 
recombination site (loxP site). In designing the oligonucleotide 
20 comprising the recombination site, it may be desirable to avoid 
introducing an ATG or start codon that might initiate translation 
inappropriately, or in-frame stop codons. 

For expression vectors intended to generate a fusion protein 
25 between a protein domain located at the amino-terminus of the fusion 
protein and the protein encoded by the donor DNA, care may be taken 
to place the recombination site in the correct reading frame such that 1) 
an open reading frame is maintained through the recombination site on 
pHOST and 2) the reading frame in the recombination site on the 
30 acceptor vector is in frame with the reading frame found on the 
recombination site contained within the circular donor DNA. 
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Optionally, the eukaryotic expression vector may also carry an 
origin of replication and selectable marker genes that function in 
bacterial cells, forming a shuttle vector. After recombination between 
the circular donor DNA and the acceptor vector to form a fused 
expression vector, this vector can be used to transfect animal cells. In 
the transfected cell, a eukaryotic promoter, either earned by the donor 
DNA or the acceptor vector, causes expression of the donor DNA under 
suitable conditions. 

The circular acceptor may also be a yeast expression vector 
such as a S. cerevisiae expression vector that includes a recombination 
site. Various types of S. cerevisiae expression vector include episomal 
or plasmid vector, integrating vectors, and yeast chromosomes (YACs). 
A YAC-based expression vector may be used to carry large segment of 
donor DNA, which is then maintained as a separate chromosome in the 
host yeast cell. 

The circular acceptor vector may also be a baculovirus DNA 
(genome) that is modified to contain a recombination site (e.g. a loxP 
site). For example, the baculoviral genome of Autographa califomica 
multiple nuclear polyhidrosis virus (AcMNPV) may be modified to 
include a site-specific recombination site, such as a loxP site. A. 
califomica (the alfalfa loop) and over 30 other insect species can be 
infected by AcMNPV. This virus also grows well on many insect cell 
lines, such as Sf cell lines derived from the fall armyworm, Spodoptera 
frugiperda. In these cells, the the promoter of the viral protein, 
polyhedrin, is exceptionally strong when the virus infects the cell. Such 
polyhedrin promoter can promote high level expression of downstream 
foreign gene that replaces the polyhedrin gene. 
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The recombination site (e.g. a loxP site) may be introduced into a 
baculoviral vector through homologous recombination in vivo. For 
example, a baculoviral transfer vector may be used to shuttle the loxP 
site into the wild type baculoviral DNA (e.g. AcMNPV) to form the final 
5 circular baculoviral vector. An example of such transfer vector is the 
p36C that contains baculoviral polyhedrin flanking sequences, up- and 
down-stream AcMNPV DNAs for homologous recombination. Page 
(1989) Nucleic Acids Res. 17: 454. The lopP site may be introduced 
into the transfer vector by regular oligonucleotide-mediated 

1 0 mutagenesis. A portion of the polyhedrin flanking sequence is replaced 
with the loxP site using an oligonucleotide containing the loxP site. The 
the resulting loxP-containing transfer vector and the wild type baculoviral 
DNA are co-transfected into insect cells such as Sf9 cells. Because the 
transfer vector contains the polyhedrin flanking sequences, a double 

15 crossover homologous recombination occurs in the cells, causing the 
replacement of the polyhedrin gene in AcMNPV DNA with the loxP site 
and therefore resulting in the integration of the loxP site into the 
AcMNPV genome. After a desired period of time (e.g. 72 hr) the 
supernatant of the insect cell culture is harvested and the progeny virus 

20 is screened in a standard agarose overlay assay. Brown and Faulkner 
(1977) J. Gen. Virol. 36: 361-364. Polyhedrin-negative plaques are 
purified to homogenity by successive rounds of agarose overlay assay. ' 
The presence of the loxP site may be confirmed by Southern analysis of 
DNA from the infected cells. 

25 

Optionally, the baculoviral acceptor vector according to the 
present invention may not contain the polyhedrin promoter. Instead, the 
polyhedrin promote or a baculoviral p10 promoter is positioned 
upstream of the donor DNA sequence of the circular donor DNA. After 
30 site-specific recombination between the circular donor DNA and a 

baculoviral acceptor vector, the resulting recombinant baculoviral vector 
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Also optionally, the circular acceptor vector according to the 
present invention may include a negatively selectable marker gene, 
5 such as the selectable marker based on herpes simplex virus tk gene or 
any kind of gene encoding a protein that signals apoptosis and causes 
programmed cell death, such as the CAR1 gene. Expression of these 
negatively selectable marker genes results in cell death, thereby 
eliminating those cells containing the circular acceptor vector that does 
1 0 not recombine with the circular donor DNA. 

The present invention also provides kits for generating 
recombinant vectors. In one embodiment, the kit comprises: a double- 
stranded circular donor DNA comprising a donor DNA sequence, a 
1 5 donor recombination site, and at least one selectable marker, the 

circular donor DNA not including an origin of replication; and a circular 
acceptor vector comprising an origin of replication and an acceptor 
recombination site capable of recombining with the circular donor DNA. 

20 In another embodiment the kit comprises: a library of double- 

stranded circular donor DNA comprising a donor DNA sequence which 
varies within a library of donor DNA sequences, a donor recombination 
site, and at least one selectable marker, the circular donor DNA not 
including an origin of replication; and a circular acceptor vector 

25 comprising an origin of replication and an acceptor recombination site 
capable of recombining with the circular donor DNA. 

In yet another embodiment, the kit comprises: one or more linear 
donor DNA comprising a donor DNA sequence; a linear driver DNA 
30 comprising a promoter sequence, a recombination site, and at least one 
selectable marker, ligation of the linear donor DNA and the linear driver 
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DNA resulting in a circular donor DNA; and a circular acceptor vector 
comprising an origin of replication and an acceptor recombination site 
capable of recombining with the circular donor DNA. 

5 

3. Method of Generating Recombinant Expression Vectors 
through Site-Specific Recombination 

The present invention provides a method for generating 
1 0 recombinant expression vector in vitro through site-specific 

recombination between a circular donor DNA and circular acceptor 
DNA, each containing recombination site recognized by the 
recombinase. The method comprises: contacting a circular double- 
stranded donor DNA and a circular acceptor vector in the presence of a 
1 5 recombinase under conditions suitable for the circular double-stranded 
donor DNA and circular acceptor vector to recombine to form a single 
fused circular vector. 

In this method, the circular double-stranded donor DNA 
20 comprises a donor DNA sequence, a donor recombination site, and at 
least one selectable marker, but not including an origin of replication. 
The circular acceptor vector comprises an origin of replication and an 
acceptor recombination site capable of recombining with the circular 
donor DNA. The promoter for regulating expression of the donor DNA 
25 may be contained in either the donor DNA or acceptor vector. 

According to this method, the circular donor DNA containing a 
site-specific recombination site can be recombined with a circular 
acceptor vector in the presence of a site-specific recombinase, such as 
30 Cre recombinase. In vitro Cre recombinase can catalyze fusion of the 
donor DNA into the acceptor vector which contains another copy of the 
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recombination site (e.g. loxP site) found on the circular donor DNA. 
Optionally, the circular donor DNA may contain a promoter positioned 
upstream of the donor DNA sequence to regulate expression of the 
donor DNA once the recombinant vector is introduced into a host cell. 

5 

Alternatively, the promoter for regulating expression of the donor 
DNA sequence may be positioned upstream of the recombination site of 
the acceptor vector. Following the site-specific recombination between 
the recombination sites located on the donor DNA and acceptor vector, 
10 the two circular DNAs are stably fused in a manner that places the 
expression of the donor DNA sequence under the control of the 
promoter contained within the acceptor vector. The recombination 
occurs in a manner that retains the proper translational reading frame of 
the donor DNA. 

15 

Following the in vitro recombination, a portion of the reaction 
mixture may be used to transform, transfect or transduce a suitable host . 
cell to permit the recovery and propagation of the recombinant 
expression vectors. The correctly fused recombinant vector may be 

20 selected for its ability to transform, transfect or transduce a host cell and 
express the selectable marker that is contained in the circular donor 
DNA and recombined into the acceptor vector. The selectable 
phenotype conferred by the selectable marker gene on the recombinant 
vector may be change of color of the host cells upon proper chemical 

25 treatment, secretion of protein in the culture that is detectable by 

colorometers or fluoremeters, survival of cells under selection pressure 
or enhanced propagation of a virus such as baculovirus. The circular 
donor DNA cannot replicate in cells because it does not contain an 
origin of replication, and therefore, unless the circular donor DNA has 

30 integrated into the acceptor vector that contains an origin of expression, 
the circular donor DNA should not replicate in the host cell. The 
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recombinant expression vector may be isolated from host cells which 
display the desired phenotype and verified by using various methods 
such as restriction enzyme digestion, sequencing and Southern blotting. 

5 The present invention also provides a method for generating 

recombinant expression vectors from linear DNA segments in vitro. The 
method comprises: ligating one or more double-stranded linear donor 
DNA which includes a donor DNA sequence with a double-stranded 
linear driver DNA which includes a promoter sequence and a donor 

10 recombination site to form a single circular donor DNA, the singular 
circular donor DNA not including an origin of replication, where the 
donor DNA sequence is under the transcriptional control of the 
promoter; and contacting the circular donor DNA and a circular acceptor 
acceptor vector in the presence of a recombinase to form a single fused 

1 5 circular vector. In this method, the circular acceptor vector comprises 
an origin of replication and an acceptor recombination site capable of 
recombining with the circular donor DNA. 

According to this method, the linear donor DNA and linear driver 
20 DNA may contain matching restriction sites or other type of annealing 

sites so as to be ligated to form a circulaized DNA. After the ligation, the 
circularized donor DNA contains a site-specific recombination site 
carried by the linear driver DNA. Such a circular donor DNA is 
recombined with a circular acceptor vector in the presence of a site- 
25 specific recombinase, such as Cre recombinase. The Cre recombinase 
can catalyze fusion of the donor DNA into the acceptor vector which 
contains another copy of the recombination site (e.g. loxP site) found on 
the circular donor DNA. 

30 Figure 1 illustrates a general scheme of this method according to 

the present invention. As illustrated in Figure 1, a library of linear donor 
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DNA (underlined) is directly ligated to a linear driver DNA (underlined) to 
generate a circular donor DNA. This allows for efficient cloning of donor 
DNA (e.g. cDNA library) into an expression vector in an automated and 
high throughput manner. 

5 

The linear donor and driver DNA can be generated by PCR 
amplification of a template DNA, either linear or contained in plasmids of 
cDNA clones. As illustrated in Figure 1 , a library of cDNA clones from a 
desired source may be rearrayed into 96-well plates by using a re- 

10 arraying robot. The library of cDNA clones may be cosmid and phage 
libraries that contain inserts primarily from single human chromosomes 
isolated by flow-sorting. These cDNA clones may be identified by their 
short sequence tags or expressed sequence tags (ESTs). These cDNA 
clones may be arrayed in multi-well microtiter plates such as 96- or 384- 

15 well plates and handled by robots. For each re-arrayed plate, data 
tracking system may be used to identify each clone, sequence 
accession number, passage number, etc. 

The library of linear donor DNA may also be generated and 
20 amplified from total RNA or mRNA samples by using an RT-PCR 

method. The library of linear donor DNA can then be ligated with the 
linear driver DNA to form the circular donor DNA which is recombined 
with the circular acceptor. This allows direct transferring of the cDNA 
library (donor DNA) to the expression vector (acceptor vector) without 
25 going through a cloning step. 

Alternatively, the library of linear donor DNA may be generated 
by random or site-directed mutagenesis of one or more target gene 
sequence. For example, "poisoned" PCR or DNA shuffling techniques 
30 may be used to generate a diverse library of donor DNA which can be 
incorporated into the circular donor DNA by using direct ligation 
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according to the present Invention. Clones of these mutagenized library 
can be screened for improved or unique biological functions in an 
appropriate expression system. 

5 Bacterial cultures of the cDNA clones may be grown in the multi- 

well microtiter plates and directly used for PCR. Suitable PCR primers 
containing an annealing site may be used to amplify each cDNA clone in 
the plate. For example, PCR primers may be designed to hybridize to 
the sequence of the cDNA clones and contain sequences for Ligation 

10 Independent Cloning (LIC). High-fidelity thermostable polymerases are 
preferably used to reduce copying errors and numbers of amplification 
cycles. The PCR product, the linear cDNA library, may optionally be 
purified by passing through mini columns. 

15 Similarly, the linear driver DNA may also be amplified by using 

primers containing a matching annealing site as the primers for 
amplifying donor DNA. The template of the driver DNA may also be a 
plasmid which contains functional elements, such as a site-specific 
recombination site, a promoter, a selectable marker gene, a tag, and a 

20 transcription termination signal. 

Still referring to Figure 1, the linear driver DNA is annealed and 
ligated with the linear donor DNA under suitable conditions in multiple- 
well plates (e.g. in the presence of T4 ligase). For ligation independent 
25 cloning (LIC), the linear driver DNA and the linear donor DNA generated 
by LIC PCR amplification are digested with T4 polymerase, annealed, 
and ligated together to form a library of circular donor DNAs 
(underlined). 

30 The circular donor DNA may be mixed with a circular acceptor 

vector (underlined) containing a suitable recombination site (e.g. loxP) in 
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the presence of a site-specific recombinase (e.g. Cre recombinase). For 
example, a circular baculoviral vector containing a loxP site may be 
mixed with an array of circular cDNA (circular donor DNA) in the 
presence of purified Cre recombinase or cell extract containing Cre. 

5 The amount of recombinase which is added to drive the recombination 
reaction between the donor and vector DNAs can be determined by 
using a titration assay is used to determine the appropriate amount of 
the purified recombinase enzyme or the appropriate amount of a Cre 
extract. The recombination reaction buffer compatible with purified Cre 

10 recombinase may contain 50 mM Tris-HCI (pH 7.5), 10 mM MgCI 2 , 30 
mM NaCI and 1 mg/ml BSA. The concentration of the circular donor 
DNA and the acceptor vector may vary between 10 ng to 1 0 pg of each 
vector per 20 \x\ reaction volume. The recombination reaction may be 
incubated at 37°C for a necessary period of time and terminated by 

1 5 heating at 70°C for 1 5 min. 

Site-specific recombination between the donor cDNA array and 
the acceptor baculoviral vector in the presence of Cre recombinase 
results in a recombinant baculoviral vector (the vector DNA in Figure 1). 

20 The recombinant baculoviral vector may be used to transfect Sf9 cells 
(the host cell in Figure 1) arrayed in multi-well plates. The plates may 
be incubated at 27°C for several days. The progeny baculovirus may be 
screened in an overlay assay of the supernatant. For example, the 
presence of the integrated donor DNA in the viral progeny may be 

25 determined by detecting level of expression of the selectable marker 

(e.g. p-GAL, GUS). Alternatively, a baculovirus apoptosis resistant gene 
(baculovirus p35 gene) may be used as a selectable marker to confer 
increased viral yield in certain incest cell lines. Clem et al. (1991) 
Science 254: 1388-1390. Recombinant baculovirus bearing this gene 

30 has been shown to be amplified up to a million folds in appropriate host 
cells. Lerch etal (1993) Nucleic Acid Res. 21: 1753-1760. 
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The protein product expressed by the recombinant expression 
vector may be seletively identified and purified by using affinity tagging. 
The tag element may be a short DNA sequence inserted between the 

5 promoter and the transcription termination sequence in a proper reading 
frame on the recombinant expression vector. Such a tag element may 
encodes a short amino acid sequence that specifically binds to a 
compound or a macromolecule. After transcription and translation of the 
recombinant expression vector in the host cells the short amino acid 

1 0 sequence which is part of the donor DNA construct, acts as an 

identification tag (affinity tag). In another word, the tag carried by the 
circular donor DNA that is recombined with the acceptor DNA is 
expressed as a fusion protein encoded by the donor DNA. The 
presence of the fusion protein can be identified through the affinity 

1 5 binding of the tag to its binding partner. 

For example, a polyhistidine tag (e.g. His 6 ) may be used for 
isolating fusion proteins. The fusion protein carrying a His tag can be 
isolated by passing all proteins of a cell extract (e.g. Sf9 cells) through a 

20 column of Ni-triacetic acid ararose beads. The unbound proteins are 
eluted from the column, and the bound fusion protein is selectively 
removed by either adding a competitor compound (e.g. imidazole) that 
dislodges the His tag of the fusion protein from the Ni ions or by lowing 
the pH of the elution buffer. If necessary, the tag may be removed form 

25 the fusion protein with a protease that cleaves only at the engineered 
site. 

Alternatively, the affinity tag may be a protein tag (e.g., GST, 
maltose binding protein) or an epitope tag that are short amino acid 
30 sequences binding to glutathione, maltose and specific antibodies. For 
example, an antibody against an EE epitope tag may be used for 
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identifying and purifying fusion protein encoded by the donor DIMA. The 
EE (Glu-Giu) antipeptide antibody was raised against a peptide that 
contains the major tyrosine phosphorylation site of polyoma middle T 
antigen. Talmage etal. (1989) Cell 59: 55-56. The antibody has 

5 moderate affinity for the EE tag (K„ ~ 2x1 0" 7 ) which allows rapid elution 
of tagged fusion proteins by free peptide under non-denaturing 
conditions while retaining efficient binding of the fusion protein in crude 
lysates. Further, since the EE tag is a strong tyrosine kinase 
phosphorylation substrate for protein kinases (e.g. Src) it may serve as 

10 a detectation label for hig h throughput assays for protein interactions. 

The methods of the present invention allow rapid and efficient 
generation of expression vectors containing the gene of interest without 
bacterial cloning. Various libaries of cDNA or genomic DNA that are 

15 difficult to be cloned in bacteria can be directly amplified and introduced 
into any expression vector. The methods can be used in a wide variety 
of high through arrays for functional genomics, protein genomics and 
other human genome projects. For example, the methods may be used 
for systematic functional analysis gene profiling, gene tagging, gene 

20 overexpression, or systematic transcript analysis. The information 

generated can shed light on functionally important pathways in diseased 
cells in many important areas such as oncology and inflammation. 
Further, the methods may be used for high-throughput genetic screens 
for target discovery and validation, as well as drug discovery based on 

25 the targets discovered in the screens. 

EXAMPLE 

Figure 2 illustrates an example of how to construct a baculoviral 
30 expression vector for the GUS gene according to the present invention. 
As illustrated in Figure 1, a recombinant baculoviral expression vector is 
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generated in vitro without a cloning step. The following is a detailed 
elucidation of the steps of using the method. 

1. Construction of the driver plasmid 

5 

A linear double-stranded driver DNA (underlined) is generated 
from a driver plasmid containing a baculoviral p10 promoter, a Neo 
marker gene, a loxP site and appropriate restrictions sites such as Sfi I 
and Not I sites. The Driver plasmid is constructed by using standard 

1 0 plasmid construction techniques as taught in Sambrook, et al. 

Molecular Cloning: A laboratory Manual; DNA Cloning: A Practical 
Approach, Vol I & II (D. Glover ed.); Oligonucleotide Synthesis (N. Giat, 
ed.). The driver plasmid is based on pBR322, a standard bacterial 
cloning vector plasmid, and the following elements are cloned into 

1 5 pBR322 contiguously, in the order shown below: 

a) an Sfil restriction site (GGCCNNNNNGGCC) 

b) aloxPsite 
(ATAACTTCGTATAATGTATGCTATACGAAGTTAT) 
[SEQ ID NO: 1] 

20 c) a Neo gene with promoter and SV40 poly A addition 

site 

d) a baculovirus p10 promoter 

e) an EE-tag sequence (MEEEEYMPME) [SEQ ID NO: 5] 

f) a Notl restriction site (GCGGCCGC) 

25 

Sequence of the baculoviral p10 promoter is described in Weyer 
U, Possee RD. (1989) J. Gen. Virol. 70:203-8 and is supplied by 
pAcAB4 from BD Pharmingen, San Diego, CA. The sequence for the 
Neo gene with a promoter and SV40 poly A addition site is available 
30 from the plasmid plE-neo supplied by Novagen Inc. 
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2. Amplification and processing of the donor gene DNA 

A iinear double-stranded donor DNA (underlined) is amplified by 
PCR from a donor plasmid or a cDNA clone carrying the GUS gene. 

5 Oligonucleotide PCR primers that are homologous to the 5' and 3' ends 
of the GUS gene (source: pBacPAK8-GUS available from Clontech, 
Palo Alto, CA) are designed and synthesized. These primers also 
contain restriction sites (Notl for the 5' primer and Sfil for the 3* primer) 
in the 5' ends of the primers. The GUS gene is then amplified by PCR 

1 0 from the plasmid pBacPAK8-GUS containing the GUS gene using these 
primers. 

The PCR products from GUS gene amplification are digested 
with Notl and Sfil restriction endonucleases to generate the appropriate 
15 sticky ends on the products, resulting in the linear double-stranded 
donor DNA. 

3. Construction of the circular donor DNA 

20 The driver plasmid described in Section 1 is digested with Notl 

and Sfil restriction endonucleases. The fragment which contains the 
elements listed above such as the loxP site and p10 promoter is isolated 
by gel electrophoresis purification, resulting in the linear double- 
stranded driver DNA. 

25 The linear driver DNA is annealed to an excess of the linear 

donor DNA and ligated by using T4 DNA ligase to produce the circular 
donor DNA (underlined). 

4. Construction of the acceptor baculovirus genomic DNA : 

30 A baculovirus transfer vector plasmid containing a loxP site is 

constructed according to the protocol described by Peakman et al. 
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(1992) Nucleic Acid Res. 20:495-500. Briefly, a double-stranded 
oligonucleotide consisting of a loxP sequence [SEQ ID NO:1] with a 
blunt 5* end and a CTAG 5' overhang at the 3' end is synthesized. A 
baculovirus transfer vector pVL1 392 (available from BD Pharmingen, 

5 San Diego, CA ) is digested with EcoR V and BamH I restriction 

enzymes to remove the polyhedrin promoter in the vector. The double- 
stranded oligonucleotide with the loxP site is ligated with the EcoR 
V/BamH I digested plasmid pVL1392. The resulting ligated plasmid is 
transformed into bacteria and screened for colonies with the loxP insert. 

10 This generates a baculoviral transfer vector containing a loxP site. 

The baculoviral transfer vector containing a loxP site is co- 
transfected with a linearized baculovirus DNA (BacPAK6 DNA from 
Clontech, Palo Alto, CA) by using standard baculovirus construction 
techniques (Methods in Molecular Biology Vol. 39: Baculovirus 

15 Expression Protocols, Christopher D. Ricardson ed., Humana Press, 
Totowa, NJ 1995). 

Recombinant baculovirus that contains the loxP site is isolated by 
plaque purification and baculoviral DNA is prepared from the virus to 
produce the circular acceptor DNA. 

20 

5. Recombination of the circular donor DNA and the circular 
acceptor DNA and selection of the recombinant baculoviral DNA 

The circular acceptor DNA and the circular donor DNA generated 
25 as described above is recombined in the presence of GST-Cre 

recombinase (available from Invitrogen, San Diego, CA) by using the 
methods described by Liu et al. (1998) Current Biology 8:1300-1309. 
The mixture is then heated at 65°C to inactivate the Cre recombinase. 

30 The circular acceptor DNA that has recombined with the circular 

donor DNA is selected and purified from the recombination reaction 

-40- 



WO 02/00875 



PCT/US01/19770 



mixture by using a method of "physical selection" described in Shepard 
& Rae (1 997) Nucleic Acid Research 25:31 83. Briefly, a biotinylated 
synthetic oligonucleotide probe that is substantially homologous to any 
region present in the circular donor DNA is used to hybridize with the 

5 recombined baculoviral DNA. The hybridized complex is then separated 
by using avidin-coated magnetic beads via biotin-avidin high affinity 
binding. The circular donor DNA that is not recombined with the 
acceptor DNA may also be extracted from the mixture. Since the 
circular donor DNA is incapable of replicating in a host insect cell and 

1 0 thus does not interfere with further functional analysis of the recombined 
baculoviral vector. The biotinylated oligonucleotide probe is preferably 
homologous to a GC-rich region of the circular donor DNA. 

6. Propagation and selection of the recombinant baculovirus 

15 

The recombinant baculovirus DNA selected in the above- 
described process is transfected into Sf9 insect cells using standard 
baculovirus construction techniques (Methods in Molecular Biology Vol. 
39: Baculovirus Expression Protocols, Christopher D. Ricardson ed., 
20 Humana Press, Totowa, NJ 1995). 

The recombinant GUS baculovirus is passaged in the presence 
of G418 as described by Lerch and Friessen (1993) Nucleic Acid Res. 
21:1753-1760. This is a positive selection step for propagation of the 
25 recombinant GUS baculovirus containing the Neo gene as a selectable 
marker. 

7. Assessment of recombinant protein production 

30 The GUS activity is assayed by using the following protocol. The 

recombinant GUS baculovirus produced as described above is used to 
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infect Sf9 insect cells. After 2 days the Sf9 cells are lysed in a solution 
of: 20mM Tris pH8, 150 mM NaCI, 0.5% NP40 (10ml of solution per 
gram of cells). The resulting lysate is serially diluted into PBS plus 150 
ug/ml x-Gluc. Blue color in diluted samples indicates GUS activity. The 
5 terminal dilution of the recombinant GUS baculovirus produced by using 
the method of the present invention is compared with that of a 
recombinant GUS baculovirus produced by using conventional 
recombination techniques. 

1 0 GUS protein expressed by the recombinant GUS baculovirus of 

the present invention is visualized by SDS gel electrophoresis and its 
levels of GUS expression are compared with those of GUS expression 
of a recombinant GUS baculovirus produced by using conventional 
recombination techniques. Briefly, Sf9 insect cells are infected with the 

1 5 recombinant GUS baculovirus of the present invention. After 2 days, the 
infected cells are lysed in a solution of: 20mM Tris pH8, 150 mM NaCI, 
0.5% NP40 (10ml of solution per gram of cells). The lysate is loaded 
and run on an SDS-PAGE gel and stained with Coomassie blue. The 
intensity the 75Kd GUS band from the recombinant GUS baculovirus 

20 produced by using the method of the present invention is compared with 
that from a recombinant GUS baculovirus produced by using 
conventional recombination techniques. 

It will be apparent to those skilled in the art that various 
25 modifications and variations can be made in the compounds, 

compositions, kits, and methods of the present invention without 
departing from the spirit or scope of the invention. Thus, it is intended 
that the present invention covers the modifications and variations of this 
invention provided may come within the scope of the appended claims 
30 and their equivalents. 
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CLAIMS 

What is claimed is: 

5 1 . A library of double-stranded circular donor DNA, comprising: 
a donor DNA sequence which varies within a library of donor 
DNA sequences; 

a donor recombination site; and 
a selectable marker, 
1 0 wherein the circular donor DNA does not include an origin of 

replication. 

2. The library of circular donor DNA according to claim 1 , wherein 
the library of donor DNA is a library of cDNA or genomic DNA 

15 

3. The library of circular donor DNA according to claim 2, wherein 
the cDNA library is derived from single human chromosomes. 

4. The library of circular donor DNA according to claim 1 , wherein 
20 the circular donor DNA further comprises a promoter sequence that 

controls expression of the donor DNA sequence. 

5. The library of circular donor DNA according to claim 4, wherein 
the promoter is derived from bacteria, yeast, insect, animal, plant or 

25 virus. 

6. The library of circular donor DNA according to claim 4, wherein 
the promoter is selected from the group consisting of E. coli lac and trp 
operons, the tac promoter, the bacteriophage X p L promoter, 

30 bacteriophage 77 and SP6 promoters, p-actin promoter, insulin 

promoter, human cytomegalovirus (CMV) promoter, HIV-LTR, RSV-LTR, 
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SV40 promoter, baculoviral polyhedrin and p10 promoter. 

7. The library of circular donor DNA according to claim 4, wherein 
the promoter is an inducible promoter. 

8. The library of circular donor DNA according to claim 7, wherein 
the inducible promoter is selected from the group consisting of 
tetracycline, heat shock, steroid hormone, heavy metal, phorbol ester, 
adenovirus E1 A element, interferon, and serum inducible promoters. 

9. The library of circular donor DNA according to claim 1 , wherein 
the donor recombination site is a DNA sequence recognized by a site- 
specific recombinase to facilitate site-specific fusion between the circular 
donor DNA and an acceptor vector containing an acceptor 
recombination site. 

1 0. The library of circular donor DNA according to claim 1 , wherein 
the donor recombination site a recombination site recognized by a 
recombinase, a transposase or an integrase. 

1 1 . The library of circular donor DNA according to claim 1 , wherein 
the donor recombination site is a lox site that is recognized by the Cre 
recombinase of bacteriophage P1 . 

1 2. The library of circular donor DNA according to claim 1 1 , wherein 
the donor recombination site is selected from the group consisting of 
loxB, loxL, loxR, loxP, loxP3, loxP23, loxA86, IoxA117, loxP511,and 
loxC2. 

1 3. The library of circular donor DNA according to claim 1 1 , wherein 
the the donor recombination site is selected from the group consisting of 
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att sites recognized by the Int recombinase of bacteriophage A, a FRT 
site recognized by FLP recombinase of the 2pi plasmid of 
Saccharomyces cerevisiae, a recombination site recognized by the 
resolvase family, and a recombination site recognized by transposase 
5 of Bacillus thruingiensis. 

14. The library of circular donor DNA according to claim 1 , wherein 
the selectable marker is selected from the group consisting of p- 
galactosidase, fluorescent protein, secreted form of human placental 

10 alkaline phosphatase, p-glucuronidase, antibiotic resistance genes, 
yeast seletable markers Ieu2-d and URA3, apoptosis resistant genes, 
and antisense oligonucleotides. 

1 5. The library of circular donor DNA according to claim 1 , wherein 
1 5 the circular donor DNA further includes an affinity tag. 

1 6. The library of circular donor DNA according to claim 1 5, wherein 
the affinity tag is selected from the group consisting of a polyhistidine 
tract, polyarginine, glutathione-S- transferase, maltose binding protein, a 

20 portion of staphylococcal protein A, protein A, and epitope tag. 

17. The library of circular donor DNA according to claim 15, wherein 
the affinity tag is an EE tag. 

25 1 8. The library of circular donor DNA according to claim 1 , wherein 
the circular donor DNA further includes a polyadenylation signal. 

1 9. A double-stranded circular donor DNA, comprising: 
a donor DNA sequence; 
30 a donor recombination site; and 

at least one selectable marker, the circular donor DNA not 
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including an origin of replication. 

20. A circular acceptor vector, comprising: 
an origin of replication; and 

5 an acceptor recombination site capable of recombining with a 

donor DMA, the acceptor vector not including a promoter for regulating 
expression of the donor DNA. 

21 . The circular acceptor vector according to claim 20, wherein the 
1 0 circular acceptor vector is capable of transforming, transfecting or 

transducing a host cell selected from the group consisting of a bacterial, 
yeast, animal, plant, and insect cell. 

22. The circular acceptor vector according to claim 20, wherein the 
1 5 circular acceptor vector is a prokaryotic plasmid. 

23. The circular acceptor vector according to claim 22 further 
comprising: a prokaryotic termination sequence selected from the 
group consisting of the 17 termination sequence, the bacteriophage X 

20 T 1MT , T u , T u , T L3 , TR 1f TR 2 , and T 6S termination signals. 

24. The circular acceptor vector according to claim 1 9, wherein the 
circular acceptor vector is a mammalian expression vector. 

25 25. The circular acceptor vector according to claim 24 wherein the 
mammalian expression vector contains one or more eukaryotic marker 
genes, a eukaryotic transcriptional and translational termination signal 
and a polyadenylation signal. 

30 26. The circular acceptor vector according to claim 24 wherein the 
mammalian expression vector includes an origin of replication and a 
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selectable marker gene that functions in bacterial cells. 

27. The circular acceptor vector according to claim 1 9 wherein the 
circular acceptor vector is a yeast expression vector. 

5 

28. The circular acceptor vector according to claim 27 wherein the 
yeast expression vector is selected from the group consisting of 
episomal vector, plasmid vector, integrating vector, and yeast 
chromosomes. 

10 

29. The circular acceptor vector according to claim 20 wherein the 
vector is a baculoviral vector. 

30. The circular acceptor vector according to claim 28 wherein the 

1 5 baculoviral vector is a modified Autographs califomica multiple nuclear 
polyhedrosis virus. 

31 . The circular acceptor vector according to claim 30 wherein the 
modified Autographs califomica multiple nuclear polyhedrosis virus does 

20 not include a polyhedrin promoter. 

32. A kit for generating a recombinant expression vector, comprising: 
one or more linear donor DNA comprising a donor DNA 

sequence; 

25 a linear driver DNA comprising a promoter sequence, a 

recombination site, and at least one selectable marker, ligation of the 
linear donor DNA and the linear driver DNA resulting in a circular donor 
DNA; and 

a circular acceptor vector comprising an origin of replication and 
30 an acceptor recombination site capable of recombining with the circular 
donor DNA. 



-47- 



WO 02/00875 



PCT/US01/19770 



33. A kit for generating a library of recombinant expression vectors, 
comprising: 

5 a library of double-stranded circular donor DNA comprising a 

donor DNA sequence which varies within a library of donor DNA 
sequences, a donor recombination site, and at least one selectable 
marker, the circular donor DNA not including an origin of replication; and 
a circular acceptor vector comprising an origin of replication and 

10 an acceptor recombination site capable of recombining with the circular 
donor DNA. 

34. A method for generating recombinant expression vector, 
comprising: contacting a circular double-stranded donor DNA and a 

1 5 circular acceptor vector in the presence of a recombinase under 

conditions suitable for the circular double-stranded donor DNA and the 
circular acceptor vector to recombine to form a single fused circular 
vector, 

the circular double-stranded donor DNA comprising a 
20 donor DNA sequence, a donor recombination site, and at least one 
selectable marker, but not including an origin of replication, and 

the circular acceptor vector comprising an origin of 
replication and an acceptor recombination site capable of recombining 
with the circular donor DNA. 

25 

35. The method according to claim 34, wherein the circular donor 
DNA further includes a promoter for regulating expression of the donor 
DNA. 

30 36. The method according to claim 34, wherein the recombination 

sites on the circular donor DNA and the circular acceptor vector are both 
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37. The method according to claim 34 further comprising steps of 
transforming, transfecting or transducing a host cell; and 

5 selecting the correctly fused recombinant vector based on the 

selectable phenotype conferred by the selectable marker gene on the 
recombinant vector. 

38. The method according to claim 37 further comprising: 

1 0 selecting the correctly fused recombinant vector by using a biotin- 

labeled oligonucleotide which is capable of hybridizing with circular 
double-stranded donor DNA. 

39. The method according to claim 34, wherein the recombinase is 
1 5 selected from the group consisting of the bacteriophage P1 Cre 

recombinase, yeast FLP recombinase, Inti integrase, bacteriophage A,, 
phi 80, P22, P2, 186, and P4 recombinase, Tn3 resolvase, the Hin 
recombinase, the Cin recombinase, E. coli xerC and xerD 
recombinases, Bacillus thuringiensis recombinase, Tpnl and the |J- 
20 lactamase transposons, and the immunoglobulin recombinases. 

40. A method for generating recombinant expression vectors, 
comprising: ligating one or more double-stranded linear donor DNA 
which includes a donor DNA sequence with a double-stranded linear 

25 driver DNA which includes a promoter sequence and a donor 

recombination site to form a single circular donor DNA, the circular 
donor DNA not including an origin of replication, the donor DNA 
sequence being under the transcriptional control of the promoter, and 
contacting the circular donor DNA and a circular acceptor vector 

30 in the presence of a recombinase to form a single fused circular vector, 
the circular acceptor vector comprising an origin of replication and an 
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acceptor recombination site capable of recombining with the circular 
donor DNA. 

41 . The method according to claim 40, wherein the linear donor DNA 
5 and linear driver DNA contain matching restriction sites. 

42. A method for generating recombinant expression vectors, 
comprising: ligating a library of double-stranded linear donor DNAs, 
where each member of the library includes a donor DNA sequence, with 

10 a double-stranded linear driver DNA which includes a promoter 

sequence and a donor recombination site to form a single circular donor 
DNA, the single circular donor DNA not including an origin of replication, 
the donor DNA sequence being under the transcriptional control of the 
promoter; and 

1 5 contacting the circular donor DNA and a circular acceptor vector 

in the presence of a recombinase to form a single fused circular vector, 
the circular acceptor vector comprising an origin of replication and an 
acceptor recombination site capable of recombining with the circular 
donor DNA. 

20 

43. The method according to claim 42, wherein the library of double- 
stranded linear donor DNAs are DNAs amplified from a library of cDNA 
clones. 

25 44. The method according to claim 43, wherein the library of cDNA 
clones is arrayed in a multi-well plate. 

45. The method according to claim 43, wherein the library of cDNA 
clones is a cosmid or phage library. 

30 

46. The method according to claim 42, wherein ligating the library of 
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double-stranded linear donor DNAs with a double-stranded linear driver 
DNA is performed by ligation independent cloning. 
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SEQUENCE LISTING 

<110> Clark, Robin 

<120> COMPOSITIONS AND METHODS FOR GENERATING EXPRESSION VECTORS THROU 
GH SITE-SPECIFIC RECOMBINATION 

<130> 12636-239 

<150> US 09/606,323 
<151> 2000-06-28 

<160> 5 



<170> Patentln version 3.0 

<210> 1 

<211> 34 

<212> DNA 

<213> Artificial sequence: loxP 

<400> 1 

ataacttcgt ataatgtatg ctatacgaag ttat 
4 



3 



<210> 2 

<211> 34 

<212> DNA 

<213> Artificial sequence: loxP511 

<400> 2 

ataacttcgt atagtataca ttatacgaag ttat 3 
4 



<210> 3 

<211> 34 

<212> DNA 

<213> Artificial sequence: loxC2 

<400> 3 

acaacttcgt ataatgtatg ctatacgaag ttat 3 
4 



<210> 4 

<211> 34 

<212> DNA 

<213> Artificial sequence: FLP 

<400> 4 
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gaagttccta ttctctagaa agtataggaa cttc 3 
4 



<210> 5 
<211> 10 
<212> PRT 

<213> Artificial sequence: EE-tag 
<400> 5 

Met Glu Glu Glu Glu Tyr Met Pro Met Glu 
15 10 
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